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Abstract 

Bernoulli-p thinning has been well-studied for point processes. Here 
we consider three other cases: (1) sequences (-Xi, X2, ■■■)', (2) gaps of such 
sequences (X„+i — A"i)„ 6 n; (3) partition structures. For the first case we 
characterize the distributions which are simultaneously invariant under 
Bernoulli-p thinning for all p £ (0, 1]. Based on this, we make conjectures 
for the latter two cases, and provide a potential approach for proof. We 
explain the relation to spin glasses, which is complementary to important 
previous work of Aizenman and Ruzmaikina, Arguin, and Shkolnikov. 



1 Introduction and main conjecture 

Motivated by the theory of mean field spin glasses, we consider thinning for 
random partition structures. In spin glasses a configuration consists of N spins 
(ox, . . . ,(Jn) in {+1, — 1} and has energy proportional to N. One wishes to 
take N — > 00 to obtain the thermodynamic limit. But each time N increases 
by 1, the number of configurations doubles. Additionally there is a shift in the 
energy of each configuration at least of order 1, because the energy also scales 
as N. 

Both the energy shift and the entropy shift must be considered in order to 
understand equilibrium states from the "cavity" perspective. For the entropy 
part of the dynamics, it is more intrinsic to imagine taking spins away one at a 
time. This leads to the notion of thinning. Thinning has been much studied in 



"This is a revision. The original version was written in May 2011, and we have left the 
affiliations as they were then. 
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the context of random point processes, most notably by Matthes, Kerstan and 
Mecke [9] and Kallenberg |S]. 

For uncorrelated spin glasses, such as Derrida's Random Energy Model 
(REM) [BJ, the natural objects are random partitions structures. These are 
random probability measures, ignoring the underlying structure of the points of 
the sample space. This structure is unimportant for the REM because of the 
IID nature of the energies. 

In an important paper, Ruzmaikina and Aizenman considered competing 
particle systems, from the perspective of energy gaps, motivated by the energy 
shift aspect of the REM li'i ■ This was followed by alternative perspectives by 
Arguin [3] who considered random partitions structures, and Shkolnikov [IB] 
who considered a more general family of energy shifts allowing for lattice type. 

In the present paper we consider the entropy shift from the perspective 
of thinning for random partition structures. We describe this setting next. 
Our interest is to characterize the set of random partition structures which are 
thinning invariant. In particular, we will state a conjecture and support it with 
evidence. 

Notation. We write X ~ /i to mean that A is a random variable with 
distribution /i. We write X _U_Y to indicate that X and Y are independent. 
Given a random variable Z which has been constructed in some way, we will 
write £(Z) for the marginal distribution (or law). 

1.1 Bernoulli-p thinning for partition structures 

Let A denote the set of all £ = (£i, £2, ■ • ■ ) satisfying £1 > £ 2 > • • • > and 
£1 +£2 + ■ ■ • < I- These are the partition structures. With the product topology, 
A is a compact and metrizable space. 

The continuous component of the random measure is £0 := 1— (£1 +£2 + - • • )• 
This is sometimes also called "dust." We define Aqo to be the subset of those 
£ £ A such that one or both of the conditions is satisfied 

• £0 > 0, or 

• \{n : £„ > 0}| = 00. 

We call these infinite partition structures. 

Let AA(A) denote the set of Borel probability measures on A. These are the 
random partition structures. 

Definition 1.1 Given a complete separable metric space X, we always letA4(X) 
denote the set of Borel probability measures on X. 

Let A / J(A 00 ) denote the subset of those fi £ A4(A) satisfying /z(Aoo) = 1. 
These are the random infinite partition structures. 

For each p £ (0, 1], the thinning map is 9 p : _M(Aoo) — > M(A QO ), defined as 
follows. Given fi £ A^(Aoo), let £ = (£1, £2, . . . ) be a random element of Aoo, 

£~m- 
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Let B = {B\, B2, ■ ■ ■) be independent Bcrnoulli-p random variables, 

Let Z' = p£ + Bi£i + B 2 ^2 + ■ • • • Let Ki = minjfc : Bk = 1} and inductively 
define K n+ \ = min{fc > K n : B^ = 1}, for n £ N. Note that, a.s., such a 
sequence exists and is unique. Let us define £ = (CijCa> • • • ) such that 

Cn = ZkJZ' for n £ N, 
so that C is an element of Aoo. Then with all this 

OM := £(C). 

We say that fi € ^(Aqo) is thinning invariant if 9 p {n) = fi for all p £ (0, 1]. 
Our main goal is to characterize the class of all thinning invariant partition 
structures, which we will do as a conjecture. 

1.2 Poisson-Kingman partition structures 

An excellent reference for Poisson-Kingman partition structures is the review 
by Pitman [12]. These are also related to Lambda-coalescents as in [11]. For a 
detailed review we recommend [1]. 

Let C denote the set of all pairs (A, v) where v is a nonnegative number and 
A is a Borel measure on (0, 00) satisfying 

J min{a;, 1} dA(x) < 00 . 

We let Coo denote the subset consisting of all pairs (A, v) £ C such that one or 
both of the following conditions is satisfied 

• v > 0, or 

• A((0, 00)) = 00. 

Given (A,v) £ Coo, the Poisson-Kingman random partition structure V(a,v) G 
Ai(Aoo) is defined as follows. 

Definition 1.2 Given a complete separable metric space X, let A4+(X) denote 
the set of all (nonnegative) Borel measures, not necessarily normalized or finite, 
and let 23+ (X) denote the set of all nonnegative Borel functions. Given p £ 
A4 + (X) let PPP(X,p) £ A / J(A / J + (X)) denote the Poisson point process on X 
with intensity measure p, which is the distribution of a random Borel measure 
S £ M + {"X), specified by its moment generating functional: 

5 - PPP(X, j o) V/GB+(X), E[e-fx fd3 ] = exp^- J (l-e~ f )dp\ . 

See Chapter 7 of Daley and Vere- Jones for more details J^j 
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Now let S be a random measure on (0, oo), 

S - PPP((0,oo),A). 

We define the random variable Z = v + S((0, oo)). Almost surely there exists a 
sequence £ = (£1, £2, ■ • ■ ) with £1 > £2 > • ■ • > and such that 

00 



We define £„ = for each neN, and let f = (£i, £2, • • • )• Then 
1.3 Conjecture for partition structures 

For each iel, define the shift a x : R — > R as a x (y) — x + y for each y € K. 
Let 5^ : X + (M) -> .M + (R) denote E x (p) = p o a~ x . Let us define 

M st (M+(R)) := {Q G M(M+(R)) : Viel, = Q} . 

In other words, A4 st (A4+(M.)) is the set of stationary random measures. 

Let I • I denote Lebesgue measure on R. Let A^+,i(R) denote the subset of 
all those p G M + (M.) satisfying 

lim r 1 f p(<j-\A))dx = lim r 1 [ p{<j-\A))dx = \A\ 

for every Borel set ACM. 

Given p G M +! i(M) and given m > 0, let us first define p^ G M+(M) as 

dp^ix) = me rnx dp(x), 

and then let us define A Pi?rl G M + ((0,oo)) by 

where e_ : R — > (0, 00) is the mapping e_(.x) = eT x . 

Remark 1.3 Consider the special case p = | • |. Then A Pi . m (dx) = mx~ m ~ 1 dx 
for each m G (0,1). Then the Poisson- Kingman partition structure U/\ m ) * s 
called the Poisson- Dirichlet random partition structure PD(m,0) G A^A^) by 
Pitman and Yor This is important in spin glass theory as in \14\ EJ/. 

Let X denote the disjoint union of _M +)1 (R) X (0, 1) and {1}. Let M st (0C) 
denote the set of Borel probability measures on 3C such that for each Borel set 
A C M +A (R) x (0, 1) and each x G R, 

Q({(p,m) : (E x (p),m)G A}) = Q(A) . 
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Given Q G M Bi (OC), we define Nq G .M(Aoo) to be a special example of a "Cox- 
Kingman" partition structure, defined such that for any Borel set A C A^, 

N Q (A) = Q({l})p (0A) (A) + [ u (Apmfi) (A)dQ(p,m). 

J M+,i(BL)x(0,l) 

Conjecture 1.4 (i) For any pair Q,Q' G A4 s t(0C), if ATq =Nqi then Q = Q' . 
(ii) The set of thinning invariant [i G .M(Aoo) is precisely {A/q : Q G M s t(3C)}. 

1.4 Outline for the rest of the paper 

In the next section we are going to state the analogous problem of thinning for 
sequences. For this problem we have a rigorous characterization. This is also 
an interesting question on its own for reasons we will explain. In Section 4 we 
will state the analogous problem for "gaps." In a way this generalizes the set 
of thinning invariant random partition structures. We also obtain a conjecture, 
which we support by an argument parallel to the theorem for thinning invariant 
sequences. But there is one missing step of our argument which is establishing 
tightness of a certain sequence of measures. Finally, in Section [S] we return to 
the problem of thinning invariant random partition structures. In particular, 
we discuss the connection to Derrida's REM more explicitly than in the intro- 
duction, showing the two complementary pieces of the cavity step involving the 
energy shift that Aizenman and Ruzmaikina already solved and the entropy 
shift corresponding to thinning. 

2 Thinning-invariant sequences 

Let U be a compact, metrizable space. Let U fi denote the set of all sequences u = 
(ui,u 2 , . . . ) with ui, u 2 , ■ ■ ■ EU. Given fi G M(U n ) and p G (0, 1] we consider 
a new measure 6 P (^) in A1(W N ), defined as follows. Let U = (Ui, U2, ■ ■ ■ ) be a 
random element of W N , 

U - /i. 

Additionally, let B ~ (Bi, B 2 , . . . ) be i.i.d., Bernoulli-p random variables, 

B_\LU. 

Let K\ < K 2 < . . . be the sequence uniquely defined, a.s., by the condition 

{K U K 2 ,...} = {keN : B k = l}. 
We define V=(V 1 ,V 2 ,...)asV n = U Kft for n G N. With all this, we define 

6 P ( M ) L(V). 

We say that /1 G A4(U N ) is thinning invariant if O p (/i) = /i for all p G (0, 1]. 
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2.1 Motivation 



After we initially stated our conjecture for thinning invariant random parti- 
tion structures Dmitry Panchenko suggested characterizing thinning invariant 
sequences. Panchenko also told us of the connection to exchangeability via 
"spreadability." Ryll-Nardzewski introduced the notion of spreadability [TS]. A 
sequence is spreadable if, for every non-random subsequence ki < k% < . . . , 

{Xk t , X k2 , . . . ) = (Xl , X2, ■ ■ ■ ) • 

It is easy to see that this is equivalent to exchangeability. Our main theorem 
below will show that the notion of thinning invariance is strictly weaker than 
spreadability. 



2.2 The Poisson construction for sequences 

Let C^(R) denote the set of all strictly increasing homeomorphisms <f> : R — > R. 
Let M^(R) denote the subset of all a e A4 + (R) such that the mapping 

F a (x) = a((— oo,x)) 

is a homeomorphism of R onto (0, 00). There is a right action of C^(R) on 
A^(R): a ^aof 1 . 

Let tV1^(R x IX) denote the set of all Borel measures aonRxW such that the 
marginal a(- x U) is in .M^(R). Given a g M&(R x U) we define v a E M(U N ) 
as follows. Let H be a random measure on R x U, 

S ~ PPP(R x U,p). 

Almost surely there is a unique random sequence Ui), (X 2 , Uz),--- ElxM 
such that Xi < X2 < ■ ■ ■ and 

00 

n=l 

Let U = (Ui, U2, ■ ■ ■ ), and define 



2.2.1 Symmetry 

There is still a right action of C^(R) on A^(R xW): a4«o(^x idu)^ 1 where 
id^ is the identity mapping on hi. Inspection shows that 

^po^xidw)- 1 = V P ■ 

There are distinguished representatives for A / J^(R x W)/C^(R). 

Let A^Leb(R x hi) denote the set of all Borel measures 7 on R x hi such that 
the marginal j(- x hi) is | • |. Given 7 g .A/fLeb(R xW), we define the new Borel 
measure 7W <= A^^(R x hi) by taking 

^ x \dx x du) = e Xr y(dx x du) . (1) 
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2.3 Cox construction and characterization 

Recall that for each iglwe defined a x <E C^(M) as <J X () = ■ + x. Define 

% : X L ob(K xW)4 -M L ob(K x U) , E x ( 7 ) = 7 o x id^)" 1 . 
Then we denote the stationary distributions for random measures 

M st (M Lch (R x U)) = {Q e M(M Lcb (R xW)) :V.ieM, Qo E" 1 = Q} . 
Given Q £ M st {M hch {R x W)), we define JV~ e M(U N ): 

JM^ h (M.xU) 

for each Borel set A C U N . 

Theorem 2.1 (i) IfQ,Q' are measures in M s t(Mheb{^x U)) withAfg = Mq,, 
then Q = Q' . 

(ii) The set of all thinning invariant measures [i G A4(U ts ) is precisely 
{Mq : Q S M st (M Lch (R x «))} . 

This is the most important theorem that we can prove, rigorously. The proof is 
not difficult, but it does use a couple of nice ideas already in the literature. In 
particular, we use a model introduced by Aldous, and it seems that our proof 
is the first application of this process, which Aldous introduced apparently for 
its sheer beauty. 

3 Proof of Theorem EJ] 

Theorem 12.11 is proved by combining two ideas. The first is the idea of Pois- 
sonization as in the paper of Ruzmaikina and Aizenman |14j . The second is 
appeal to an important stochastic process that David Aldous described in his 
paper [2] related to Hoyle's steady state model. We describe the steady state 
model first. This model is a Markov process version of the thinning invariant 
point processes, which were proved to be just Poisson processes by Matthcs, 
Kerstan and Mecke [S] and Kallenberg [5]. 

Aldous's explanation of the steady state model recalls an old, now defunct 
cosmological model due to Fred Hoyle, who sought a stationary alternative to the 
big bang theory. Hoyle eventually converted to the big bang theory, himself. See 
for example, [H]. But Aldous's description is a natural Markov process, equal 
to the time reversal of thinning. We recall the thinning version first, which is 
easier. 

We take the underlying space to be [0, 00), and restrict attention to infinite, 
locally finite random point processes, meaning that the random sample 3 £ 
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A4 + ([0> oo)), almost surely may be expressed as 

oo 
n=l 

for some sequence of points satisfying < Ri < R 2 < • • • , and R n — » 00. 
Now, let Ti, T2, . . . in [0, 00) be i.i.d, random variables 

T„ ~ Exp(l) for each n e N, 

and such that 

(T 1 ,T 2 ,...)JL{Ri,R 2) ...). 
Then for each t > 0, let S t g 7W + ([0, 00)) be the random measure 

00 

S * = 22 l[0,T n )(*) $R n exp(-t) ■ 
n=l 

If H is initially a homogeneous Poisson point process, e.g., 

S ~ PPP([0,oo),||-||), 

then the process S t is stationary. This is easy to see using the moment gener- 
ating functional from Definition 11.21 

For any r € R we define the stochastic process S Tjt for t € [t, 00) started 
at time r as a Poisson point process on [0, 00) with intensity equal to Lebesgue 
measure || • |j, and evolved according to the description above. By Kolmogorov's 
extension theorem, these consistent distributions may be extended to give a law 
for a stationary process S-oo^ for all tel. 

Then one can consider the time reversed version, (H_ 0Oi _{) te g, which is the 
version Aldous described, and which corresponds to Hoyle's steady state model. 
At any given time t there is a spatial Poisson point process of particles. As one 
increases t and considers S-^-t the particles move apart at a constant rate. 
New particles are born into the universe to fill in the gaps. 

3.1 Marked version 

Now, we consider the marked version of the steady state model. Suppose that 
at time r one has an initial random point process 2 now on [0, 00) x hi. Let us 
suppose that taking the marginal on [0,oo), the point process S(- xU) is still 
a Poisson process of intensity equal to Lebesgue measure. Then almost surely 
there is a sequence Ui), (R2, U2), • • • G [0, 00) x U, such that 

00 

n=l 

and 

0<R 1 <R 2 < ... . 
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Let T\, T 2 , . . . be IID random variables 

Ti,T 2 ,... ~ Exp(l) and (Ti,T 2 , . . . ) JL ((i?i, I/i), (ifc, E/b), . . . ) . 

Then for t € [r, 00), define the random point process S Tjt such that for any 
Borel measurable set A C [0, 00) x W, 

00 

3 T , t (4) - ^l [0iTn) (t-r)l A (e r -*i?„,C/„). 

n=l 

The marginal S Tjt (- x W) is the steady state model as before. 

There are two important observations to remark upon. Firstly, suppose that 
S = S r ^ T has some distribution as described above. Let U = (Ui, U2, ■ ■ ■ )■ Let 
us define p in A4(W N ) to be the distribution 

p &JC(U). 

At any time t e [r, 00), we may also almost surely find a sequence 

((iJi(t),^i(t)),(i22(t),^ 2 (t)),...) 
such that < Ri(t) < R 2 (t) < . . . and for any Borel set A C [0, 00) x U, 

00 

S T , t (A) = ^l A (i?„(i),f/„(i)). 

n=l 

Then, defining £/(i) = U2(t), . . . ) and // T)t as 

Mr,t := -C(tf(t)), 

we have 

Mr,t = ©exp(i— i) (m) . 

Building on this, we make the second observation. Let Mheb([0, 00) x U) 
denote the set of all Borel measures p on [0, 00) x U such that p(- xU) = \ ■ \ . 
Given p in this set, let us consider 5 to be the random point process 

E ~ PPP([0,oo) xU,p). 

Then it satisfies the conditions above. More importantly, by the thinning prop- 
erty of Poisson processes, 

Er >t ~ PPP([0, M )XW,$ ( _ T (/))), 

where for each t e [0, 00), the mapping $ t : A4([0, 00) x U) -> A4([0, 00) x U) is 
non-random: let <f> t : [0,oo) — > [0, 00) be the mapping 4>t{s) = st, and then 

®t(p) = e _ *p o (</>cx P (-t) x id^r 1 • 
By inspection, $ t sends the subset Mheb([0, 00) x U) back to itself. 
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3.2 Poissonization 

Given p £ A^Leb([0, oo) x U), let us denote a measure v p by taking 

5 - PPP([0,oo) xU,p), 
which almost surely may be written as 

oo 
n=l 

for a sequence (i?i, Ui), (R2, U2), ■ ■ • G [0, 00) x U with < R\ < R2 < ■ ■ ■ , and 
then defining 

d p ?= L{U U U2,...). 
Given Q e M (A^Leb([0, 00) x U)), we define Nq such that 

N Q (A) = f D p {A)dQ(p), 

^A^Leb([0,Oo)xW) 

for every Borel set A C U N . 

Lemma 3.1 Suppose that p, € A4(U N ) is thinning invariant. Then there is a 
Borel probability measure Q € Ai (A^Lcb([0, 00) x hi)), such that p = Nq. 

The proof is simple. We use an idea from [14]. The idea of Poissonization is 
natural in these problems. 

Proof: Suppose p is thinning invariant. Let U = (Ui, 1/2,...) be random 
and U ~ p. Independently, let R2, ■ ■ ■ ) be distributed according to a stan- 
dard Poisson point process on [0,oo). Take 

00 

„ def v j. 

n=l 

This is not Poissonian, but we will push it back in time to —00 to obtain 
something asymptotically Poissonian at time 0. 

Consider the following approximation. Let S' be a random point process, 
such that, conditional on H, we have 

S' - PPP([0,oo) X U,E). 

Then for any r s R and t € [r, 00), the process described before as S^. t is 
essentially the same as S Tj t with the following difference: In S Tj t each original 
point receives a Bernoulli random variable with survival rate p = exp(r — t) 
to determine if it remains or is deleted; whereas, in S^. t , each original point 
is copied a Poisson number of times, with rate p = exp(r — t). We want to 
compare S t and S^. in the limit where r — > —00. 
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Suppose n is a fixed integer. If we keep track of the first n/p of the original 
points of S, whether we act on them according to Bernoulli thinning or "Poisson 
thinning" we see on average n survivors which is an order- 1 quantity. More 
precisely in the first setting the number of survivors is binomial with parameters 
(n/p,p) and in the latter case it is Poisson with parameter n. By the usual law 
of small numbers the number of survivors converge in distribution in the limit 
p — > 0. Note that if we take t = then p = exp(r) so that p — > as r — > — oo. 

In addition to just keeping track of the number of points, we should also 
check the point-values themselves. According to the weak topology, if we can 
couple each of these processes with a small failure rate, then that suffices to show 
convergence to zero of the difference in the limit as r — > — oo. The failure rate 
for the Poisson-to-Bernoulli coupling is proportional to the probability of having 
two or more survivors according to any of the n/p original Poisson variables. 
Since each Poisson random variable has rate p, and since the probability of two 
or more points is on the order of p 2 , this shows that the overall failure rate is 
bounded by a constant times np. This converges to zero for each fixed n G N, 
in the limit as p — > 0. 

Since we have shown that the distributions of S Ti o and HJ. are asymp- 
totically close in the limit r — » — oo, we now want to check that there is a 
limit-point for the distribution which has the desired properties. One can 
take the random meaure p = S so that, conditional on this value 

E' ~ PPP([0,oo) xU,p). 

Note that, conditioning on the random value of p — S, we have 

s; ~ PPP([0,oo) xU,$- r (p)). 

Now by the strong law of large numbers, one can see that for almost all choices 
of p, 

lim $_ T (p)(A x U) = \A\ . 

T— > — OO 

This is just a fact about the original steady state model because we are taking 
the marginal on [0, oo). For instance $_ T (p)([0, x) x U) just measures 

e T \{n : R n < xexp(-r)}| , 

for the original Poisson point process (i?i, i?2, . . . ) on [0, oo). From this, we see 
that if there is a limit-point for the distributions of $_ T (/?), along some subse- 
quence (ri,T2, . . . ) with r n — > — oo, then this limit will be a Borel probability 
measure Q which is supported on 7WLeb([0, oo) x U). 

To guarantee that a subsequential limit point exists, we just need to check 
tightness. (See, for example, [7]-) One may break up [0, oo) into countably 
many intervals [n — 1, n) for n G N. Thus we may consider A^ + ([0, oo) x K) as 

Y[M+([n-l,n)xU), 
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and in fact the product topology is appropriate for the local weak topology on 
point processes on [0, oo) x U. In the last paragraph we already established con- 
vergence to a Poisson point process for the marginal on [0, oo), which guarantees 
tightness for the marginal on each interval [n — l,n). Since 14 is compact the 
marginal on U is automatically tight. Also the product topology of any number 
of compact sets is also compact. This guarantees tightness overall. □ 

3.3 Proof of Theorem [27T] part (i) 

Suppose p G A4(U N ) is thinning invariant. Then, according to Lemma l3.ll there 
is some Borel probability measure Q supported on Mhcb{[0, oo) x U) such that 
fi = Nq . By the comments at the end of Subsection 13.11 it is clear that for any 
r < 0, we could replace Q by Q\ T m, the ergodic average 

Q M] (o •= J- y Q $-\.)dt. 

One still has the desired condition that u = A77 

V[ T ,0] 

The total variation norm of (Q[ T o] ° — Q[tO]) * s bounded by t/\r\. By 
tightness/compactness, we may find a sequence (ri, T2, ■ ■ ■ ) with r n — > — oo such 
that Q\ Tn o] converges. Therefore, we obtain Q* , a Borel probability measure 
supported on A^Lob([0,oo) x U) such that Q* o $ t _1 = Q* for every t € [0, oo), 
and such that /i = A/q* . For notational convenience, we will assume that this 
measure was chosen at the outset: Q — Q* . 

With this, we can set-up a steady state model on [0, oo) x U with stationary 
measure given by first selecting a density p € A^Leb([0, oo) x U) distributed ac- 
cording to Q, and then taking the random point process S, such that conditional 
on p, 

E ~ PPP([0,oo) xU,p). 
Note that for r G M and t € [r, oo), conditional on p, we have 

E Ttt ~ PPP([0,co) x«,$ t . r W). 

But since p ~ Q and since Q ° = f° r au * ^ [0, oo), this means we still 
have 

So the overall distribution, averaging over p, is invariant. By Kolmogorov's 
extension principle, we can now extend to obtain the model when r — > — oo. 

3.3.1 Uniqueness 

Suppose that S_oo j is this process. In other words, at each time i, it is a 
point process on [0, oo) x U: its marginal distribution at any given time is as a 
Cox process with mixing measure Q, and it evolves as t increases as indicated 
above. We may define J 7 /^^-! to be the cr-algebra of (S_ oot ) t < r . Then, for any 
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measurable set A C [0, oo) x U, we may consider the reversed (or backwards) 
Doob's martingale 

E[H_oo,o04) l^-oo-t]] for t€[0,oo). 

According to Doob's reversed martingale convergence theorem, we know that 
this converges, a.s, and in expectation. (See for example [7], Section 5.6.) Let us 
call the limit p(A). Note that this is measurable with respect to J--oc, the back- 
wards tail c-algebra. On the other hand, using the notation (R n (t), J7„(t))^_j 
introduced before, we see that for t <G [0, oo) the conditional distribution of 
S_oo,o (A) given 

— OO,— t] is 



E 



XE^o(A) Tf ,] = e e%(S_ 00) _ t )(A)ln(l-(l-e- A )e- t ) > (2) 



(— oo,-t] 



Using @ in a first pass, one may see directly that 

E[H_oo,o(j4-) l^-oo,-*]] = ^(H-oo,-*)^). 

Using this fact, and making a second pass at ©, we see that, conditional on 
J--OC, ELqoo^) is distributed as a Poisson random variable with parameter 
p(A). 

To gain insight into the distribution of p, note that by stationarity of S-^ j, 
the random measure $ t (S_ OQ _ t ) has the same distribution as <fr t (^~oo .o)- But 
the distribution of S_ co is obtained as follows: let p be random with p ~ Q, and 
conditional on this S-oo,o ~ PPP([0, oo) xU,p). Moreover, since Q o ^ 1 = Q. 
we may make yet another change and still have the same distribution. Choose 
p randomly with p ~ Q, and then conditionally on this, for each t G [0, oo), let 
St be random with 

St - PPP([0,oo) xU,$- t (p)). 

Note that $t is well defined for t < as well as for i > 0. 

Then the overall distribution of ^((St) is still the same as ^((S-oo^), after 
averaging over p as well as the conditional Poisson measure, given p. On the 
other hand, we claim that, conditional on p, we almost surely have convergence 
$ t (S t ) —> p in distribution. To see this, note that for any / € 2? + ([0, oo) x IX), 



E 



f /d$ t (S_oo, t ) 

J[0,oc)xU 



exp / (l-exp(-e * /)) e* dp 

W[0,oo)xW ; 



using Definition 11.21 and the definition of $ t . By the monotone convergence 
theorem, this converges to exp(— J f dp) which is the moment generating func- 
tion for p, and this uniquely characterizes the random measure. This implies 
that the distribution of p is Q. Since p was obtained from the process, itself, 
using the martingale convergence theorem, this is the usual proof of uniqueness 
of Q (similar to the proof of uniqueness for the directing measure of infinite 
exchangeable sequences of random variables in de Finetti's theorem [T]). 
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3.4 Births and a change of variables 

We now consider the steady state model we have been considering in reversed 
time (S_ oc __ t ) teR . This was the perspective taken by Aldous. In this perspec- 
tive new points are being created by a space-time Poisson process with marks. 
In the thinning perspective these would be extinction times for particles com- 
ing from — oo. From this perspective, S_oo,_ t (A) = H(^ t (A)), where H is a 
space-time-marking point process and 

V t (A) = {(s, re- 1 - 3 , u) : (r, u) € A , s€ [-t, oo)} . 

One may determine H by starting with the distribution of and then 

performing deletions, and keeping track of the points as they are deleted. 

At a time t, the distribution of points is E-o^t which is Poisson on [0, oo) x U, 
with intensity $t(p), where we are conditioning on F-oo, continuing with the 
analysis of the last section. Each such point is deleted at uniform rate. So 
in an infinitesimal time interval [t,t + dt), the distribution of deleted points is 
$i (p) (dr x du) dt. But a Poisson process has the thinning property: separating 
a Poisson point process into two groups according to an independent Bernoulli 
process both groups are independent Poisson processes with certain rates. Using 
this and the infinitesimal generator of thinning $ t (p)(dr x du) dt, we see 

H~ PPP(Mx[0,oo)xW,$G5)) where $(p)(dtxdrxdu) = <5> t (p)(drxdu) dt . 

Let us rewrite this, using the definition of $4: 

S(p)([ti,t 2 ) x [n,r 2 ) x A) = [' e- t p([e t r u e t r 2 )xA)dt. 

Jti 

Now we will do a change of variables. The calculations are direct, but not trans- 
parent, so we will perform a consistency check. We will recover the marginal 
density E.^ from H using the new variables. 

Consider the function C(t, r) = (t, ln(r) + t) from R x (0, 00) to M x R. We 
may consider rectangles 

$(p)(£- 1 ({t u t 2 ) x [xi,x 2 )) x B) = I e- t p([exp(xi),exp(x 2 )) x B)dt, 

Jti 

for any Borel set B C.U. Note that the inverse mapping is £{t,x) — {t 1 e x - t ). 
Therefore, 

*t([ri,r 2 ) x B) = C-\[-t,oo) x [ln(n) - t, ln(r 2 ) - t)) x B . 
Hence, the expectation of !E_ 00i _ t ([ri, r 2 ) x B) = H(^ t ([n, r 2 ) x B)) is 

/oo 
e- s p([r 1 e t ,r 2 e t )xB)ds = e*?([rie*, r 2 e t ) x B) , 

which is <&_t(p)([ri, r 2 ) x B). This is as it should be since $_ t (p) is the intensity 
(conditional on J"_oo) for the Poisson point process 'B- 00i - t , given p. This is a 
consistency check that we have applied the inverse functions £(t, r) and £(t,x) 
correctly to the sets and the measures. 
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3.4.1 Proof of Theorem l27fl part (ii) 

Now let us consider H o (£, x id^) _1 . If we define 7W 6M + (RxW)by 

7 (1) ([a;i,a;2) x B) = p([exp(a;i), exp(ac 2 )) x B) , 
then notice that 7^ ([2:1, £3) x W) = J^ 2 e x dx. Therefore, 

7 < - 1 )(g?:e x du) = e x j(dx x c?u) , 
where 7 is in .MLeb(R xW). The calculations of the last subsection show that 
H o (£ x id^)" 1 - PPP(R xKxW,t)X7), 

where 77 is the meaure on R: dr)(t) — e~* dt. 

Using the fact that H-o^-tQn,^) X B) = H(^ t (Vi, ^2) x £)), we see that 
stationarity of (S_ 00j _ t ) t6 u amounts to stationarity in t of the marginal distri- 
bution of 

Ho£- 1 ([-t,oo)x[x 1 -t,x 2 -t)xB), (3) 

averaging over both the conditional Poisson distribution given 7, and over the 
induced measure on 7. Let us define Q to be the distribution of the random 
measure 7 coming from Q by pulling back the transformation p 1— > 7. Then 
thinning invariance at the level H ' = H o (£ x id^) _1 as in ([3|) is stationarity 
with respect to the shift: a t :Rxl given by i?t(s, x) = (s + t,x + t). Note that 
the Poisson point process H' has an intensity which may written using 7: 

drj(s) x x \dx x du) = e x ~ s "f(dx x du)ds. 

This satisfies the property that 

(77 x 7 (1) ) o (a t o iduy 1 = rj x [7 o (a t x id w ) _1 ] (1) . 

Therefore, stationarity of (5_ COi _ t ) t£ R actually amounts to stationarity of the 
Cox process on M. x U, relative to the usual shifts (a x x idy), where the the 
directing measure of the Cox process is Q. But this is equivalent to stationarity 
of the directing measure Q. See for example, Daley and Vere- Jones [5], Section 
10.1, Exercise 10.1.3, which is an easy exercise in using the definition of the 
moment generating functional as in Definition 11.21 This is the characterization 
that was to be proved. So part (ii) of the theorem is completed. 

4 Thinning for Gap Distributions 

Let r C [0, oo) N consist of all sequences w = (w\, W2, ■ ■ ■ ) satisfying 

< u>i < w% < .... 

We call these configurations of gaps, for a reason which will be apparent mo- 
mentarily 
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Given /j, E -M.(T) and p e (0, 1] we consider a new measure 9 p (n) in A4(T) 
denned as follows. Let W = (W\, W2, ■ . ■ ) be a random element of T, 

W ~ n. 

Define Wo = 0. Additionally let B = (B , Si, ... ) be i.i.d., Bernoulli-p random 
variables 

B _LW. 

Let JsTo < K 2 < ■ ■ ■ be the subsequence of {0, 1, ... } uniquely defined, a.s., by 
the condition 

{K ,K U ...} = {fce{0,l,...} : B k = l}. 

Let Vo — and V n — (Wk„ — Wk ) f° r n G N. These are gaps of the thinned 
sequence starting from (Wo, W\, . . . ). Let us define V — (Vi, V2, ■ . ■ ), which is 
an element of T, a.s. Then 

4.1 Poisson construction for gap distributions 

Let Aif(M) denote the set of all measures a € A4 + (M), satisfying 

• a((— 00, x)) < 00 for each x € M, and 

• a(R) = 00. 

Given a e A^f(M) we define v a € .A4(r) as follows. Take S a random point 
process on M with 

E - PPP(R,a). 

Almost surely, there is a sequence — 00 < X\ < X 2 < ■ ■ ■ , with 

00 

71=1 

Let U n = (X n+1 — X n ) for n e N, and note that U — {U\, U2, ■ . ■ ) is an element 
of T, a.s. We define 

V a ~L(U). 

It is easy to see that v aoa -i = v a fo all x € M. Also, 

Note that (modulo shifts) the only measures a € .MfQR) satisfying 

pa = ao <j-{ p) , 

are ot m (dx) = me mx dx for m G (0, 00), which each leads to a thinning invariant 
gap distribution v am . But the limit lim m j,o v am also exists, and is the point 
mass at (0, 0, ... ) € T. We denote this as Poo5 . These are the "pure" Poisson 
gap distributions which are thinning invariant. 
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4.2 Cox construction and conjecture for gaps 



While we have classified the "pure" Poisson gap distributions which are thin- 
ning invariant, one may generically perturb each of these by some version of 
stationary Cox processes to obtain something which is still thinning invariant, 
in complete analogy to the case of thinning invariant sequences. 

Let DC' denote the disjoint union of Al+,i(R.) x (0, oo) and {0}. Given 
(7, to) € A4+,i(R) x (0,oo), we define 

^ rn \dx) t= me mx 1 {dx). 

(Note that this is consistent with the definition ([T]) in the case m = 1.) We 
define A4 s t(0C') to be the set of all measures Q e M.{%') such that for each 
Borel set A C M+,t(R) x (0, 00) and each x € K 

Q({( 7 ,to) : (E x ( 7 ),m)eA}) = Q(A) . 
Given Q e M s t(3C) we define A/q G M{T) such that for any Borel set ACT, 

ti Q (A) = Q({0})Poo5 (A) + f P 7(m ,(A)rfQ( 7 ,TO). 

JM + ! i(l)x(0,oo) 

Conjecture 4.1 fi) For any pair Q,Q' G A1 s t(r) i/A/q = Nq> then Q = Q' . 
(ii) The set of thinning invariant /j, € M(T) is {Mq '■ Q € M s t(T)}. 



4.3 Heuristic argument for gaps 

We will outline an argument for Conjecture 14. 1[ paying special attention to the 
most important step that we cannot rigorously prove. 

Suppose that [i is a thinning-invariant gap distribution. We may choose 
W ~ fx where W = (Wi,W 2 , ■ ..)■ We define a point process S = Yl^Li $W n - 
Now, let Ti,T 2l . . . be independent Exp(l) random variables 

(Ti, T2, . . . ) JL (Wi, W2, • . . ) , 

and define S t = X^^Li ^-[o,T n )(t) o"w n ■ Then modulo a shift this is the thinning of 
So- Therefore, there is a random constant X(t) such that S t o cr^ 1 is stationary. 
From the semi-group property of deletions, it is easy to see that X t itself is 
stationary. Therefore, by the ergodic theorem, there is a random constant M 
such that 

lim — = l/M. (4) 

i^oo t 

(See for example, Chapter 7 of [7]-) In order to simplify the situation, we will 
suppose that fj, satisfies the property that M is non-random, /i-almost-surely, 
and consider this value to be to. We then shift the originally defined point 
processes: 

Z' t = St o a~f m . 
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We have taken care of the leading order effect of the shift. This is where we 
need to make a non-rigorous jump. 

Conjecture 4.2 (i) The case M = is not possible in |7J). 

(ii) For M e (0,oo], define [Ni(t), N 2 (t), . . . ) such that Ni(t) < N 2 (t) < ... 
and (iVfc(t))^ =1 = (n : T n < t) for each t. Then the distribution of {Wisr k (t) ~ 
m t) is tight in t for each k G N. 

Let us consider m £ (0, oo) first, before considering m — oo. We just quickly 
repeat the steps from the proof of Theorcm l2.11 making the appropriate changes 
for the present situation. 

We may start at time t: E' t t is S t _ T for t > r. Then by tightness, we 
may take a subsequence r„ with r„ — > — oo such that the weak limit of the 
entire distribution of E' T t converges for all t € R. Then we use Kolmogorov's 
extension theorem to construct S'^^ t . Then we may take the ergodic/Cesaro 
limit to obtain a stationary version which we assume we had done from the 
outset to avoid complicating the notation. (Note that shifting by any times 
does not affect the marginal distribution of gaps under the thinning-invariant 
assumption.) One can then consider the process in reverse. 

Now, conditional on the backwards tail-algebra F-oo, particles are born 
(backwards in time) according to a space-time process H ~ PPP(R x E,po 
a^ 1 m (dx)dt) where p is obtained by keeping track of particles in space-time, 
streaming at the constant velocity 1/m. After birth the particles drift to the 
right at this velocity 1/m. Stationarity means that the new particles are filling 
in gaps caused by the drift. Although the drift is constant in space, therefore 
preserving Lebesgue measure, recall that the pure Poisson construction for pa- 
rameter m is based on a Poisson process with intensity me mx . So that the 
particles are more dense to the right, which means if the entire point process 
drifts to the right, then the gaps become relatively bigger (since they are the 
gaps that were previously on the left). We claim that stationarity of E' t implies 
that p = 7^ for 7 stationary (with respect to shifts). We leave this as an ex- 
ercise similar to Section 13.41 rewriting stationarity of S£ in terms of properties 
of the Cox process H, and then using the exercise from Daley and Vere- Jones 
to relate stationarity of the Cox process to stationarity of its directing measure. 

Notice that for m £ (0, 00) , equation (j4)) should imply that W n — Wo / In n — > 
1/m. One can see this by noting that Ni(t) is geometrically distributed with 
failure rate p — e _t . Therefore this claim follows by a suitable Tauberian 
theorem to de-Geometrize. We would like to refer the reader to 0], Section 1.5, 
for important uses of Tauberian theorems in the context of coalescents, which 
are illuminating. 

If m = then tightness of (W / Ar 1 (t))teR implies tightness of (Wi,W2,...) 
(using another Tauberian argument, as is necessary). But due to monotonicity 
this implies there is a limsup, and then thinning-invariance implies that W = 
(0,0,...), a.s. 

We now make an important remark. 



18 



Remark 4.3 The conjecture should not be trivial. Our proof of Theorem \2.1\ 
used one idea from the paper of Ruzmaikina and Aizenman \14V -' Poissonization. 
But their characterization of gap distributions invariant under uncorrelated cav- 
ity steps involves other arguments whose analogues we have not yet found. In 
principle the present problem should require an equal amount of work. 

5 Return to partition structures 

One can move from partition structures to gap distributions by replacing £ = 
(£1 >&>••• ) with W = (Wi, W 2 , . . .) with W n = ln£i - ln£„+i for n £ N. In 
thinning the partition structure one rescales £. At the level of W this amounts to 
a particular random shift of (— ln£i, — In £2, • ■ • )■ But the gaps are defined from 
the perspective of the leading point (which is In £1 , but we find it convenient to 
introduce a reflection). So any thinning invariant partition structure leads to a 
thinning invariant gap distribution. 

Note that the proportion of dust in a random partition structure is non- 
decreasing under 9 P and it is constant only if the total amount of dust is or 
1. Therefore a thinning invariant random partition structure has an amount 
of dust {0,1}, a.s. Full dust is ^(o.i) ■ Otherwise, with no dust, the sequence 
W can be used to fully recover £, where £1 is determined by the normalization 
condition. 

We can then use the conjectured characterization of the thinning invariant 
gap distributions to find the thinning invariant random partition structures. 
But due to the normalization condition this requires m £ (0, 1). The limit as 
m — > 1 leads to f(o,i) • With this, Conjecture 11.41 follows from Conjecture 14. II 

5.1 Relation to REM 

Let B n (t) be independent Brownian motions for n £ N, centered (and with 
variance equal to t at time t). Consider weights -Xjv, n (/3) = e~^ Bn ^ N > for n = 
1, . . . , 2^. Then the REM partition function is 

2 N 

Z N ((3) = - ln^2x N , n ([3). 

71=1 

One observation is that for n £ {1, . . . , 2^}, we have 

lnX N+hn ([3) = -f3[B n (N + l)-B n (N)}+]nX N , n (/3), 

and [B n (N + 1) — B n (N)} is independent of Xn^ u . This makes contact with the 
uncorrelated cavity step dynamics introduced by Aizenman and Ruzmaikina. 
But additionally, we have 2 N new points at time step N + 1: A"at +1 „(/3) for 
n = 2 N + 1, . . . , 2 N+1 , and we mentioned in the introduction how this leads to 
the notion of thinning. 
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Suppose that £ 6 Aoo is a sample of a random partition structure. Let us 
define two random constants: 



i / 00 



-/8Z„ 



-pz 0] 



where Zq, Zi, 



and 



are independent A/"(0, 1) random variables 
(Z ,Zi,...) JL £, 

S (0 = ln (E^S„) , 



where B 1 ,B 2l ■ ■ ■ are independent Bcrnoulli(l/2) random variables independent 
of £. Then one would expect that the free energy of the REM 



clef 



?(/3) lim -—Z N {fi) 



N 



1 



/3iV 



satisfies 



J= max Eh^SfO + £(01 



(5) 



We would also hope that the arg-max is unique and is given by the limiting 
distribution of the random partition structure 



X 



N 



,0,0,. 



where X^\/3) > 



\ z N (p) Z N (P) 

<2 N ) 

> X N ((3) are the order statistics 



The reason for taking the maximum instead of the minimum is a deep insight 
of Giorgio Parisi which we will not discuss here. (See for example [ID].) 

There are actually variational principles for the REM, but we do not know of 
this statement appearing before, although we also do not have a proof of this. On 
the other hand, for the special case of Poisson-Dirichlet distributions PD(m, 0) 
one can do the calculations explicitly. This follows because for any IID random 
variables W\,Wz,--' > 0, independent of £ ~ PPP((0, 00), mi" ra_1 dx), we 
have 

00 00 
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See, for example, |14j for an informative proof of this important stability prop- 
erty. This implies that 

E[£(0] = -(/3m)- 1 lnE[e-' 3mZl ] = , and 

E[S(0] = m^kxlB?] = m _1 ln2. 
So restricting attention to the Poisson-Dirichlet distributons we would have 
5Y/3) = max (-((3m)- 1 In 2 - /3m/2) . 

me(0,l] v ' 

This gives the correct answer of course: for /3 < f3 c — \/2 In 2, one takes m = 1 
and for (i > (3 C one takes m = /3 c //3 to obtain 




-p~ l In 2-/3/2 for£</3 c , 
-/3 C for /3 > /3 C . 



Also, this leads to the correct partition structure PD(m(/3), 0). Of course, we 
have heavily handicapped the problem since we already know that this is the 
solution obtained by Derrida [5] and is even close to the original ideas. On the 
other hand, it would be interesting to know if (O is true if one considers the 
full class M{/±oo)- 
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